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1. We, Drs. Kathy Hancock and Victor C. W. Tsang are co-inventors named in the 
above-referenced patent application. 

2. We have read and understand the above-referenced patent application, including the 
pending claims, and the Office action dated April 15, 2004. 

3. It is our understanding that in Paragraph 8 of the Office action of April 1 5, 2004, 
claim 23, which concerns a composition that includes SEQ ID NO: 4 (TS-18) having one or more 
conservative substitutions, was rejected as not sufficiently enabled by the specification. As shown in 
our manuscript presented in Exhibit A (Hancock et al. J. Clin. Microbiol. 41 :2577-86, 2003), 
conservative as well as non-conservative substitutions can be made within a TS-18 protein sequence 
without significantly altering the immunoreactivity of the protein with T. solium antibodies present in 
patients with cysticercosis. 

4. Figure 1 of Hancock et al. shows an alignment of partial TS-18 sequences, 
demonstrating which amino acid residues differ between wild-type (GenBank Accession No. 
AF082828) and variants thereof (such as varl (AF098073), var3 (AF098075), var4 (AF356331) and 
var 8 (AF356345)). As shown in Figure 1, some of the amino acid residues that differ between wild- 
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type and variant TS-18 sequences are more conservative substations (in some examples indicated by 
two dots, such as K->E and E->D), while others are less conservative (in some examples indicated by 
one dot, such as K->N and F->L). 

5. To demonstrate that variant and wild-type TS-1 8 protein sequences have similar 
biological activity, immunoassay methods known at the time the present application was filed 
(Hancock and Tsang, J. Immunol, Methods 92:167-76, 1986) and disclosed in the present application 
(see page 19, line 20- page 21, line 29) were used to analyze the immunoreactivity of recombinant 
TS-18 and variants thereof. Figure 4 of Hancock et al shows the antibody reactivity of wild-type 
TS-18 compared to TS-18 varl, var3, var4 and var8. Although TS-18 var6 did not have the desired 
activity (data not shown), the other four variants tested had similar J index values (Figure 4). The J 
index is equal to the sensitivity plus the specificity minus 1. Therefore, TS-18 sequences that contain 
one or more conservative or non-conservative substitutions as disclosed in the present application 
were shown in Hancock et aL to have similar biological activity. 

6. We hereby declare that all statements made herein of our own knowledge are true and 
that all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. § 1001, and that such willful false 
statements may jeopardize the validity of the application or any patent issued thereon. 
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Characterization of the 8-Kilodalton Antigens of Taenia solium 
Metacestodes and Evaluation of Their Use in an Enzyme-Linked 
Immunosorbent Assay for Serodiagnosis 
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Sowmya Pattabhi, John Noh, and Victor C. W. Tsang 

Division of Parasitic Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia 30341 
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The Western blot for cysticercosis, which uses lentil lectin purified glycoprotein (LLGP) antigens extracted 
from the metacestode of Taenia solium , has been the "gold standard" serodiagnostic assay since it was first 
described in 1989. We report that the diagnostic antigens at 14, 18, and 21 kDa, as well as some larger 
disulhde-bonded antigens, are actually all members of a very closely related family of proteins, the 8-kDa 
antigens. The genes for 18 unique, mature proteins have been identified. Nine of these were chemically 
synthesized and tested in an enzyme-linked immunosorbent assay with a battery of defined serum samples, 
including 32 cysticercosis-positive serum samples reactive with the 8-kDa antigens of LLGP on Western 
blotting, 34 serum samples from patients with other parasitic infections, and 15 normal human serum samples. 
One of the 8-kDa antigens, TsRSl, is 100% sensitive and 100% specific. TsRSl will be one component of a 
cocktail of three to four synthetic or recombinant antigens, based on the diagnostic bands of the Western blot, 
which will be used for the serodiagnosis of cysticercosis. 



Neurocysticercosis is a debilitating disease caused by the 
metacestode of the parasitic worm Taenia solium. Humans are 
an accidental host for the metacestode, or larval form, of T. 
solium. Infection occurs when eggs are ingested via the fecal- 
oral route of transmission. Infective eggs hatch, and the liber- 
ated oncospheres cross the membrane of the small intestine 
and migrate in the body, typically ending up in the central 
nervous system, skeletal muscle, subcutaneous tissue, or ocular 
tissue (9, 10, 43). Within the host's tissues, the oncosphere 
matures into a cysticercus and causes the disease cysticercosis. 
Neurocysticercosis is an infection of the central nervous sys- 
tem. Symptoms are nonspecific and variable and include sei- . 
zures, headaches, and hydrocephalus, depending upon the 
numbers, locations, and viabilities of the cysts and the stage of 
the host immune response (4, 44, 45). Several studies have 
shown that in countries where the disease is endemic, as many 
as 18 to 50% of the cases of adult-onset epilepsy are actually 
cases of neurocysticercosis (11, 22, 26). 

In the life cycle of T. solium, humans are the definitive host 
for the adult worm and pigs are the intermediate host for the 
metacestode. Maintenance of the life cycle requires a close 
association between humans and pigs; consequently, this dis- 
ease is found wherever pigs are allowed to roam freely to 
scavenge for food and humans lack adequate sanitary facilities 
for the disposal of feces. Cysticercosis is endemic in Latin 
America, India, Asia, and Africa (14, 36). A recent estimate of 
the prevalence of cysticercosis in Latin America indicated that 
400,000 people have symptomatic disease (3). 

Diagnosis of neurocysticercosis requires either imaging of 
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the brain by computed tomography or magnetic resonance 
imaging or the demonstration of cysticercosis-specific antibod- 
ies in the patient's serum. Brain imaging is expensive and is 
usually available only in urban centers within the countries 
where the disease is endemic. In addition, single lesions due to 
Mycobacterium tuberculosis are often difficult to distinguish 
from cysts due to T. solium (4, 9). Early antibody detection 
assays lacked sensitivity and specificity (7, 15, 29, 31), but in 
1989, an assay with 98% sensitivity for the detection of cystic- 
ercosis cases with two or more cysts, less sensitivity for the 
detection of cases with single cysts (26, 28, 46), and 100% 
specificity was developed (40). The assay uses a partially puri- 
fied preparation of metacestode antigens. Cysts harvested from 
pigs are solubilized in urea, and the bound fraction from lentil 
lectin affinity chromatography, the lentil lectin purified glyco- 
proteins (LLGPs), is used as antigen in a Western blot or 
enzyme-linked immunoelectrotransfer blot assay. Immunore- 
activity with any one of seven glycoproteins is diagnostic for 
cysticercosis. Since 1995, this has been the only assay recog- 
nized by the World Health Organization and the Pan Ameri- 
can Health Organization for the serodiagnosis of cysticercosis 
(Pan American Health Organization and World Health Orga- 
nization informal consultation on the taeniosis/cysticercosis 
complex, 1997). The assay, however, has some drawbacks. It is 
dependent upon a supply of naturally infected pigs. Prepara- 
tion of the antigen and performance of the Western blot re- 
quire considerable technical expertise. The partially purified 
LLGP antigen preparation is not suitable for use in an enzyme- 
linked immunosorbent assay (ELISA) (V. C. W. Tsang, un- 
published data); and a Western blot assay is not suitable for 
field studies, nor is it a suitable or affordable assay for diagnosis 
in countries where cysticercosis is endemic. To address these 
issues, we have been systematically characterizing the seven 
diagnostic LLGP antigens. 
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The characterization of two LLGP proteins, Tsi4 and Tsl8, 
has been reported earlier (16, 17). Here we report on the 
identification and characterization of a family of diagnostic 
proteins, the 8-kDa antigens of T, solium metacestodes. The 
8-kDa antigens are the diagnostic proteins seen at 14, 18, and 
21 kDa on the Western blot and are also found in the bands at 
24 and 39 to 42 kDa. Eighteen unique mature proteins have 
been cloned by us and others (16, 24, 34) and were identified, 
by phylogenetic analysis, to sort into four cladcs. Nine were 
chemically synthesized for use as antigens. Testing of the syn- 
thetic proteins in an ELISA identified one 8-kDa protein with 
100% sensitivity when it was tested with sera from cysticercosis 
patients reactive with the 8-kDa protein components of LLGP 
on Western blot and 100% specificity. 

MATERIALS AND METHODS 
Parasite material and DNA extraction. T. solium cysticerci from Peru, India, 
and China were dissected from surrounding porcine muscle. For each cyst, the 
protoscolex was removed by dissection and washed with cold phosphate-buffered 
saline, and the DNA was extracted by using the Fast DNA kit with lysing matrix 
4 and CLS-TC buffer, according to the instructions of the manufacturer (Qbio- 
gene, Inc., Carlsbad, Calif.). For the cysts from India and China, which had been 
preserved' in 70% ethanol. an overnight incubation step at 37*0 was added alter 
the homogenization step to allow rehydration of the DNA. The DNA isolated 
from the cysts preserved in ethanol was further purified by using the QIAquick 
PCR purification kit (Qiagen, Carlsbad, Calif.), according to the instructions of 
the manufacturer, to remove PCR inhibitors. 

Amplification, cloning, and sequencing or the 8-kDa diagnostic antigens. The 
8-kDa diagnostic antigens were amplified by using two sets of primers. Primers 
gTsUF (5 ' - ATGCGTGCCrACATTGTGClTCTC-3 ') and gTsl4R2 (5'-GCA 
GTTlTrrTCTT AGG ACCTTTG CAGTG - 3 ' ) amplified the gene for TslA The 
genes for the other 8-kDa proteins were amplified by using primers gTsl4F and 
g Tsl4Rl (5'-GTGAAGAGAAGAACGCATGAAAGTTG-3'). All PCRs were 
done with Pfu polymerase (Stratagene, La Joila, Calif.) at an annealing temper- 
ature of 60°C for 40 cycles. The amplicons were cloned into the vector PCR- 
Scripl (Stratagene) according to the instructions of the manufacturer. From 4 to 
14 clones of each ampiicon were sequenced. In addition, the amplicons resulting 
from amplification of DNA from the Peruvian isolate, the Indian isolate, and the 
China isolate with gTsHF and g Tsl4R2 were directly sequenced. In all cases, 
both strands of DNA were sequenced. All sequencing was done by terminator- 
based cycle sequencing with BIGDYE fluorescent dye (Applied Biosystems, 
Foster City, Calif.) (35) and an ABI Prism 377 DNA sequencer (Applied Bio- 
svstems). 

* Sequence data were analyzed with the SeqMan II program (DNASTAR Inc., 
Madison, Wis.). Sequence homology searches were done by using the BLAST 
program (1). Signal peptide sequences were predicted by using the SignaiP 
program (23) along with N-lerminal sequence data. Alignments were done by 
using the ClustalX program (39). For the phylogenetic analyses, all sequences 
were aligned, and the amino acid sequence data common to all sequences were 
used. The phylogenetic analysis was performed with the PUZZLE program (38), 
by the FITCH method (a least-squares distance method), and by the protein 
parsimony (protpars) method (htt P y/evolution.genetics.washington.edu/phylip 
.html). Phylogenetic trees were displayed and manipulated by using the Tree- 
View program (25). 

Nine of the 8-kDa proteins were chemically synthesized, all with a long-chain 
biotin at the mature N terminus for uniform capture by streptavidin-coated 
polystyrene "sticks." Proteins Tsl8, Tsl8 varl, Tsl8 var3, Tsl8 var4, Tsl8 var6, 
and TsRS2 varl were synthesized by SynPep (Dublin, Calif.), and proteins Tsl4, 
Tsl8 var8, and TsRSl were synthesized by AnaSpec Inc. (San Jose, Calif.). All 
were of at least 95% purity, as determined by high-pressure liquid chromatog- 
raphy. The mass of each synthetic protein, as determined by mass spectrometry, 
matched the theoretical mass. Synthetic proteins were solubilized in 0.05 M 
HEPES-0.1 M NaCl-2 mM dithiothreitol (pH 7.0) and stored at ~80°C after the 
addition of glycerol to a final concentration of 43.5%. The protein concentration 
was determined by the bicinchoninic acid method (Pierce, Rockford, III.) (37). 

Battery of serum samples. Our battery of serum samples included 32 T. 
jo/wm-infected serum samples from patients with neurocysticercosis, 34 serum 
samples from patients with other parasitic infections, and 15 normal human 
serum samples (NHSs). All 32 serum samples from patients with T. solium 
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infections were positive for cysticercosis by the LLGP Western blotting assay and 
all were reactive with the 8-kDa group of antigens (40). Many of the samples 
were also confirmed to be positive for cysticercosis by brain imaging or biopsy. A 
positive reference standard was formed by pooling scra.lrom three patients with 
parasitologically confirmed, immunoblot-positivc cysticercosis. The other para- 
sitic infection scrum samples were all from parasitologically confirmed cases. 
Among these serum samples were 4 scrum samples from patients with Taenia 
saginaia infections. 3 serum samples from patients with Echinococcus muhilocu- 
laris infections, 6 serum samples from patients with Echinococcus granulosus 
infections. 11 scrum samples from patients mlhAscaris iumbricoUtes infections, 
4 serum samples from patients with Schistosoma mansoni infections, 4 serum 
samples from patients with Schistosoma haematobium infections, and 2 serum 
samples from patients with Tnchinelta sp. infections. The NHSs were obtamed 
from healthy U.S. residents who did not have a history or international travel. 

Fast-ELISA, Fasl-liLISA was performed as described previously (18). Strcpta- 
vldin (recombinant streptavidin; Roche Molecular Biochemicals, Indianapolis, 
Ind.) was used to coat the polystyrene sticks of the Falcon 3931 assay screening 
test (Becton Dickinson, Lincoln Park, N J.) at a saturating concentration of 6.0 
u.g of streptavidin per ml of 0.05 HEPES- 0.1 M NaCI ( P H 7.0). The wash buffer 
and diluent buffer for the biotinylated proteins and conjugate was 0.05 M 
HEPES-0.1 M NaCI (pH 7.0) plus 0.3% Twccn 20 (Calbiochem-Novabiochem 
Corp., La Jolla, Calif,). The serum diluent consisted of the same buffer with 5% 
nonfat dry milk. The biotinylated proteins were diluted to 1.0 u-g/ml. The serum 
samples were assayed in triplicate at 1:100 dilutions. Bound antibodies were 
detected with a pcroxidase-labeled goat anti-human immunoglobulin (41), fol- 
lowed by the substrate SureBlue (Kirkegaard & Perry Laboratories, Inc., Gaith- 
ersburg, Md.). All reagents except the sera were present in excess. After 5 mm, 
the A bW was read with a ThermoMax microplate reader (Molecular Devices 
Corp., Sunnyvale, Calif.). Each plate contained a positive reference standard, 
which' was used to calculate a relative absorbance unit for each sample by 
dividing the mean A (M) for each sample by the mean for the positive 
reference standard. As a control, the positive reference standard was included as 
a sample in the assay. If the relative absorbance for the control differed from 1.0 
by more than 10%, the data for the plate were discarded. 

Nucleotide sequence accession numbers. The nucleotide sequences identified 
in this study have been submitted to GenBank, and the accession numbers are 
given in Table 1. 



RESULTS 

The 8-kDa gene family. A family of genes encoding low- 
molecular-weight antigens has been identified in T. solium 
metacestodes. To date, 32 nucleic acid sequences have been 
identified as encoding members of this 8-kDa family of pro- 
teins (Table 1). The 32 sequences are from T. solium cysts 
collected in Peru, Mexico, India, and China. A number of these 
sequences have already been described (16, 24, 34). Additional 
members of this gene family, described in this paper, were 
identified by PCR amplification of genomic DNA. Two introns 
were identified in each of the genomic clones. Screening, at a 
low stringency, of a bacteriophage lambda T. solium metaces- 
tode cDNA library with a mixture of random primed and 
labeled probes generated from the coding sequences of Tsl4, 
Tsl8, and TsRSl, did not identify any additional gene family 
members (data not shown). 

Twenty-six unique sequences that encode 23 unique protein 
sequences were detected among the 32 nucleic acid sequences. 
These 8-kDa proteins have a signal sequence, cleavage of 
which results in a 66- or 67-amino-acid mature protein with a 
predicted molecular mass of about 8 kDa (x = 7,628 for the 
66-amino-acid proteins and 7,746 for the 67-amino-acid pro- 
teins) and a predicted pi of about 9 (x = 8.96). There are 18 
unique mature protein sequences. Within the mature protein 
sequence, all of the 8-kDa proteins have from one to three 
predicted N-linked glycosylation sites, and all of the 8-kDa 
proteins have one to three cysteines except for the proteins 
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Origin 


Source 
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AF082829 
AF 158 1 84 
AF257776 


TsH 
Tsl4 
Tsl4 


mRNA 

Genome 

mRNA 


Peru 
Peru 
Mexico 


16 
16 

24 


AF356335 
AF356336 


Ts)4 
Tsl4 


Genome 
Genome 


China 
India 


This study 
This study 


AF356337 


TsJ4 varl 


Genome 


India 


This study 


AF356338 


Tsl4 var2 


Genome 


India 


This study 


AF356339 


Tsl4 var3 


Genome 


India 


This study 


AF098073 
AF356330 


Tsl8 varl 
Tsl8 varl 


mRNA 
Genome 


Peru 
Peru 


16 

This study 


AF098074 


Ts18 var2 


mRNA 


Peru 


16 


AF098075 


Tsl8 var3 


mRNA 


Peru 


16 


AF356331 


Tsl8 var4 


Genome 


India 


This study 


AF356332 


Tsl8 var5 


Genome 


China 


This study 


AF350070 


TsKS variant 1 


mRNA 


Mexico 


24 


AF35007I 


Tsl8 variant 2 


mRNA 


Mexico 


Enciso, direct submissi 


AF082828 


Tsl8 


mRNA 


Peru 


16 


AB044081 


TSOLAglVl 


mRNA 


China 


34 


AF356333 


Tsl8 var6 


Genome 


China 


This study 


AF356334 


Tsl8 var7 


Genome 


China 


This study 


AF356345 


TsJ8 var8 


Genome 


India 


This study 


AF356341 
AF216695 
AF1 63972 


TsRSl 
Ts21 

Immunogenic protein 


Genome 

mRNA 

mRNA 


Peru 

China 

China 


This study 

Liu, direct submission 
Bin, direct submission 


AB044083 


TSOLAg2Vl 


mRNA 


China 


34 


AR044.0X2 


TSOLAt>2 


mRNA 


China 


34 


AF082830 


TsRSl 


mRNA 


Peru 


16 


AF356340 


TsRSl varl 


Genome 


India 


This study 


AF356342 


TsRSl var2 


Genome 


China 


This study 


AB044080 


TSOLAgl 


mRNA 


China 


34 


AF356344 


TsRS2 


Genome 


China 


This study 


AF356343 


TsRS2 varl 


Genome 


Peru 


This study 



■ The nucleic acid sequences for the accession number(s) in the same cluster are identical over the shared sequence. 



within the TsRSl clade. These proteins lack N-linked glycosyl- 
ation sites and cysteines. The 8-kDa proteins are hydrophilic, 
as determined by hydrophobia ty analysis (21). They lack po- 
tential transmembrane regions and glycosylphosphatidylinosi- 
tol-anchor attachment sites (8). These data suggest that the 
8-kDa proteins are extracellular secreted proteins that perhaps 
accumulate in the cyst fluid. 



The 8-kDa proteins have been shown by protein sequencing 
to be the diagnostic proteins found at 14, 18, and 21 kDa in the 
lentil lectin bound fraction from urea-solubilized cysticerci (17, 
40; data not shown). Only 8-kDa proteins have been found in 
these diagnostic bands. Protein sequencing data also show that 
the bands at 24 and 39 to 42 kDa contain 8-kDa proteins (27; 
data not shown). In addition, a polyclonal antibody raised 
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- - MRA YIVLlaA L TVFWTVSA 


AF356335 




AF356337 




AF35G339 




AF09807 3 




AK098074 




AF098075 




AP35633 1 




AF356332 


ALTVFWAVSA 


AF3 50070 


--MR VYIVLLA LTVFWDVSA 


AF350071 


- -MRVYXVLLALTVFWAVLA 


AF082828 


YIVLLALTVFWAVSA 


AB044081 


--MF VY2VLLALTVFWA VSA 


AF356333 


ALTVFWAVSA 


AF356334 


- -ALTVFWAVSA 


AF216695 


AAARAHXLLLALSVFWAVSA 


AB044083 


- -MRAH2 L LLA LSVFWA VSA 


AB044082 


- - MRAHI LLLALSVFWAVSA 


AF082830 


HI VLLALSVFWA VSA 


AF356340 


AL1VFWAVSA 


AB044080 


- - MRA YJ VLLAL TVFA VA VSA 


AF356344 


AL TVFA VA VSA 


AF356343 


ALTVFAVAVSA 



AF082829 EKNKPKDVANSTKKGIEYVH-EFFHEDPIGKQIAQLAKEWKEAMLEDKGKIRTSLVEHCKGPKKKTA 

AF3 5633 5 EKNKPKDVANSTKKGI EYVH - EFFH EDPI SKQI AQLAKEWKEAMLEDKGKI RTSLVE 

AF3 56337 EKNKPKDVANSTKKGI EYVH - EFFHEDPI SKQI AQLAKEWKEAMLEDKGKI RTSMVE - 

AF356339 EKNKPKDVANSTKKGI EYVH - EFFHEDPI SKQI AQLAKEWKEAMLEDKDK I RTSLVE 

AF098073 EKIKPKCDTNSTKKEIEYIH^FFHDDPIGKQIAQLAKJWNETVQEAKGEIRASLAEYCRGLKNKTA 

AF098074 EKIKPKCDTNSTKKEIEYIH^FFHDDPIGKQIAQLAKNWNETVQEAKGEIRASLAEYCRGLKNKTA 

AF098075 EETKPKCDANSTKKEIEYIHNWFFHDDPIGKQIAQLAKNWNETVQEAKGEIRASLAEYCRGLKNKTA 

AF356331 EKIKPKCDTNSTKKEIEYIHNWFFHDDPIGKQIAQLAKNWNETVQEAKGKFRASLAEYCRCLKNITA 

AF3 563 32 EKIKPKCDTNSTKKEIEYIHNWFFHDDPIGKQIAQLAKNWNETVQEAKGKFRASLAEYCRCLKNITA 

AF350070 EKIKPKCDTNSTKKEIEYIHNWFFHDDPIGKQIAQLAKDWNETVQEAKGKFWASLAEYCRGLKNKTA 

AF3 50071 GKNKPKCDGNSTKKEIEIFHNWFFHDDPIGKQIAQLAKDWNETVQEAKGKFWASLAEYCRGLKNKTA 

AF082828 EKNKPKCDANSTKKEIEYIHNWFFHDDPIGKQIAQLAKDWNETVQEAKGKFWASLAEYCRGLKNKTA 

AB044081 EKNKPKCDGNSTKKEIEFIHNWFFHDDPIGNQIAQLAKDWKVAMLKAKGEIRASLAEYCRGLKNKTA 

AF356333 GENKQNGDANSTKKEIEILQNWLFHVEPIGKQIVRLAKDWNAAVLEAKGKFRKSLAEYCRGLKNITF 

• AF356334 GENKQNGDANSTKKEIEILQNWLFQVEPIGKQIVRLAKDWNAAVLEAKGKFRKSLAEYCRGLKNITF 

AF2 16695 EETKPEDWKNIKKGMEVVY-KFFYEDPLGKKIAQLAKDWKEAMLEARSKVRASLAEYIRGLKNEAA 

ABO 4 4 08 3 EETKPEDWKNIKKGMEWY-KFFYEDPLGKKIAQLAKDWKEAMLEARSKVRASLAEYIRGLKNEAA 

AB044 082 KETKPEDWKNIKKGMEWY-KFFYEDPLGKKIAQLAKDWKEAMLEARSKVRASLAEYIRGLKNEAA 

AF08283 0 EETKPEDWKNIKKGMEWY-KFFYEDPLGKKIAQLAKDWKEAMLEARSKVRASLAEYIRGLKNEAA 

AF356340 EETKPEDWKNIKKGMEWY-KFFYEDPLGKKIAQLAKDWKEAMLEARSKVRASLAEYIRGLKNEAA 

AB044080 EKNKTDGVGKSIKNWIEFVH-RFFYEDPIGKQIAQLAKDWNETAPEARCKVRTLLAENRRGLKNKTA 

AF356344 EKNKTDGVGKSIKNWIEFVH-RFFYEDPIGKQIAQLAKDV^NETAPEARCKVRTLLAENRRGLKNKTG 
AF356343 EKNKTDDVGKS I KNWI EFVH - RFFYEDPI GKQI AQLAKDWNETA PEARCKVRALLAENRRGLKNKTA 

. * * . . * .* .*...*.*«*.* ; *:: 



B 

AF2 49884 MEQETNPIRRIKTKAKEYFAARERFYDEDPLGKQI AAHLSKVJREI I RDVRARLRG YLRKYLNDLQKEY PKA 

AF312736 MEQETNP IRAI KKRITSYLKSREEFYDKDPLGQKI AKFYGEWKELVAEVRKRVRARI AAYVKKLQEE 

: : : : : :*:.::*. . * . : : 

FIG. I. Alignments of the T. solium 8-kDa diagnostic antigens obtained with the ClustalX program. (A) Alignments of the 23 unique protein 
sequences from the 8-kDa gene family, identified by GenBank accession number. The predicted signal peptides are in italics. The lines between 
accession numbers mark the division of the sequences into four groups, Tsl4, Tsl8, TsRSl, and TsRS2. Additional protein sequences that are not 
shown are those with GenBank accession numbers AF257776 and AF158184, which are identical to the sequence with GenBank accession number 
AF082829; those with GenBank accession numbers AF356366 and AF356338, which are identical to the sequence with GenBank accession number 
AF356335; that with GenBank accession number AF356330, which is identical to the sequence with GenBank accession number AF098073; that 
with GenBank accession number AF356345, which is identical to the sequence with GenBank accession number AB044081; and those with 
GenBank accession numbers AF356342, AF163972, and AF356341, which are identical to the sequence with GenBank accession number 
AB044083. For this analysis, an intron-like sequence in the sequence with GenBank accession number AB044083 was removed. (B) Alignments 
of the 8-kDa proteins with the hydrophobic Iigand-binding proteins of H. diminuta (GenBank accession number AF249884) and M. expaiisa 
(GenBank accession number AF312736). Asterisks mark identical amino acid residues. More and less conservative substitutions are indicated by 
two dots and one dot, respectively. 



against Tsl4 reacts with the diagnostic bands at 14, 18, 21, 24, 
and 39 to 42 kDa (16). Reduction of the proteins in the diag- 
nostic band from 39 to 42 kDa with dithiothreitol yields pro- 
teins migrating at 14, 18, and 21 kDa, suggesting that the 8-kDa 
proteins in the band from 39 to 42 kDa are disulfide-bonded 
multimers (17, 27; data not shown). 



The 23 sequences were aligned by using the ClustalX pro- 
gram (Fig. 1A). The predicted signal peptide sequences are 
shown in italics and, as expected, are well conserved. Within 
the mature sequences, there are 12 absolutely conserved amino 
acids, 20 positions with more conservative substitutions, and 7 
positions with less conservative substitutions. Among these 23 
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FIG. 2. Phylogenetic irec lor the 8-kDa diagnostic antigens of T. 
solium. The analysis was done by quartet puzzling using a Clustal 
alignment of 73 amino acids from the 16 8-kDa proteins that are 
unique over the aligned sequence. The sequences are identified by 
GenBank accession number. The dashed ovals show the grouping of 
the sequences into four clades, TsM, TsJ8, TsRSl, and TsRS2. The 
scale bar indicates an evolutionary distance of 0.1 amino acid per 
position. 

protein sequences, 18% of the amino acids are identical and 
68% are similar or identical. Many of these identical and sim- 
ilar amino acids are clustered in the central portion of the 
mature protein, from position 27 through position 41. 

From the alignment of the 8-kDa protein sequences ob- 
tained with the ClustalX program (Fig. 1A), the 16 sequences 
with an alignment over 73 amino acids were analyzed to de- 
termine their phylogenetic relationships. Quartet puzzling (a 
maximum-likelihood method), the FITCH method (a least- 
squares distance method), and the protpars method (a maxi- 
mum-parsimony method) gave trees with similar topologies. 
The quartet puzzling results are shown as a radial tree in Fig. 
2. The 8-kDa sequences fall into four clades, with the largest 
and most diverse clade being Tsl8. The other clades are Tsl4, 
TsRSl, and TsRS2. The node support values for the four 
clades range from 90 to 99%. All four clades contain sequences 
from cysts from both Latin America and Asia. 

A BLAST search for similar sequences identified 13 homol- 
ogous proteins. The two proteins with the highest Expect val- 
ues were cestode hydrophobic ligand-binding proteins, one 
from Hymenolepis diminuata (GenBank accession number 
AF249884) and the other from Moniezia expansa (GenBank 
accession number AF3 12736). These two proteins were aligned 
with the 8-kDa proteins by use of the ClustalX program (Fig. 
IB). These hydrophobic ligand-binding proteins do not have a 
signal sequence, but their sequences align with the mature 
8-kDa sequences. Among the 8-kDa proteins and the hydro- 
phobic ligand-binding proteins, there are 3 conserved amino 
acids, 1 1 positions with more conservative substitutions, and 9 
positions with less conservative substitutions. The cluster of 
identical and similar amino acids seen in the central portion of 
the 8-kDa proteins is seen here as well. 

Phylogenetic analysis was done with the 13 homologous pro- 
teins plus 13 of the 8-kDa proteins, from all four clades, which 
aligned over 62 amino acids, as determined with the ClustalX 



program. The analysis was done by quartet puzzling, the 
FITCH method, and the protpars method. All methods gave 
trees with similar topologies. The quartet puzzling results are 
shown as a radial tree in Fig. 3. The sequences fall into five 
clades. The T. solium 8-kDa clade is the closest in evolutionary 
distance to the cestode hydrophobic ligand-binding proteins. 
The next closest clade consists of two T. solium sequences, one 
of which encodes a cyst ice rcosis-speci lie antigen (GenBank 
accession number AF076609) and the other of which encodes 
an oncosphere-specific antigen (GenBank accession number 
AF523312), and one Taenia crassiceps sequence, which en- 
codes an immunodiagnostic antigen (GenBank accession num- 
ber U07I50). The remaining eight sequences, which formed 
two clades, were all E. granulosus antigen B sequences. The 
node support values for the five clades ranged from 84 to 
100%. 

lmmunoreactivities of T. solium 8-kDa proteins. Nine of the 
T. solium 8-kDa proteins were selected for evaluation of their 
immunoreactivities with 32 serum samples from patients with 
cysticercosis, 34 serum samples from patients with other par- 
asitic infections, and 15 NHSs. One sequence was chosen from 
each of the three clades Tsl4, TsRSl, and TsRS2 varl. Six 
sequences were chosen from the Tsl8 clade (Tsl8, Tsl8 varl, 
Tsi8 var3, Tsl8 var4, Tsl8 var6, and Tsl8 var8). Preliminary 
studies with Tsl8 var6, however, showed that it had minimal 
reactivity with the positive reference standard and most other 
positive serum samples (data not shown). Therefore, no fur- 
ther assays were done with this protein. The remaining eight 
synthetic proteins were assayed, each with 81 serum samples. 
The Fast-ELISA results are shown in Fig. 4. The cutoff value 
for each protein was set for 100% specificity, and the J index 
was calculated (47). The J index is equal to the sensitivity 
(number positive in the infected group divided by the number 
of individuals in the infected group) plus the specificity (num- 
ber negative in the uninfected group divided by the number of 
individuals in the uninfected group) minus 1. With a J index of 
1.00, TsRSl was 100% specific and 100% sensitive. Tsl8 varl 
and Tsl8 var3 both had J indices of 0.97 and missed 1 of 32 
positive serum samples. Tsl8 had a J index of 0.91 and missed 
3 of 32 positive serum samples. Tsl4 and Tsl8 var4 both had J 
indices of 0.84 and missed 6 of 32 positive serum samples. Tsl8 
var8 had a J index of 0.76 and missed 10 of 32 positive serum 
samples. TsRS2 varl had a J index of 0.73 and missed 12 of 32 
positive serum samples. The three proteins with J indices 
>0.97 show minimal reactivity with the NHSs and the serum 
samples from patients with other parasitic infections. Tsl4, 
Tsl8 var8, and TsRS2 varl had higher levels of reactivity with 
some of the serum samples from patients with other parasitic 
infections and some of the NHSs. This low level of cross- 
reactivity with cysticercosis-negative sera prevents the discrim- 
ination between positive serum samples with low-level reactiv- 
ity and negative serum samples. Tsl8 and Tsl8 var4 had low 
levels of reactivity with the serum samples from patients with 
other parasitic infections and the NHSs but lacked reactivity 
with several cysticercosis-positive serum samples. The best 
choices for antigens for use in a diagnostic assay are TsRSl, 
Tsl8 varl, and Tsl8 var3. All are 100% specific and have 
sensitivities ranging from 97 to 100%. 

The reactivities of the positive sera with Tsl8 varl were 
compared with the reactivities of the positive sera with Tsl8 
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FIG. 3. Phylogenetic tree for the T. solium 8-kDa proteins and related sequences. The analysis was done by quartet puzzling using a Clustal 
alignment of 62 amino acids from 26 sequences. The T. solium 8-kDa antigen clades TsJ4, Tsl8, TsRSl, and TsRS2 arc identified. The other 
sequences are identified by GenBank accession number, as follows: AF249884, H. diminuta hydrophobic ligand-binding protein; AF312736. M. 
expemsa hydrophobic ligand-binding protein; AF362443, AF362444, S41921, S41922, S41923, S41924, U15001, and AF252859, all E. granulosus 
antigen B; AF523312, T. solium oncosphere-specific antigen; U07150, 7*. crassiceps immunodiagnostic antigen; and AF076609, T. solium cystic- 
ercosis-specific antigen. The scale bar indicates an evolutionary distance of 0.1 amino acid per position. 



var3 (Fig. 5 A). The same comparison was made between TslS 
varl and TsRSl (Fig. 5B). No difference in the level of reac- 
tivity of each serum sample with Tsl8 varl and Tsl8 var3 was 
seen. The coefficient of determination (r 2 ) was 0.98. This result 
is not surprising, since the two sequences differ by only three 
amino acids. This result also validates the reliability of the 
ELISA data. However, when the reactivities with TslS varl 
and TsRSl were compared, several positive serum samples had 
low levels of reactivity with one antigen but high levels of 
reactivity with the other antigen. The positive serum sample 
with the lowest level of reactivity with Tsl8 varl and Tsl8 var3 
had a middle range of reactivity with TsRSl. Conversely, the 
positive serum sample with the lowest level of reactivity with 
TsRSl had a middle range of reactivity with Tsl8 var 1 and 
Tsl8 var3. These variations in antibody reactivities between 
these two antigens are reflected in the low coefficient of de- 
termination (r = 0.52). Because of these differences in levels 
of antibody reactivity between Tsl8 varl-Ts!8 var3 and TsRSl, 
both Tsl8 varl or Tsl8 var3 and TsRSl will be included in 
future assays for the detection of immunoreactivities to the T. 
solium 8-kDa diagnostic antigens. 

DISCUSSION 

The proteins in the T. solium 8-kDa family of proteins are 
important for immunodiagnosis. In the LLGP Western blot for 
cysticercosis, the diagnostic bands at 14, 18, and 21 kDa are 
8-kDa proteins. In addition, the band at 24 kDa contains 8-kDa 
proteins, and disulfide-bonded multimers of the 8-kDa pro- 
teins are found in the 39- to 42-kDa diagnostic band. To date, 
18 unique mature 8-kDa protein sequences have been identi- 



fied. Nine of the 18 protein sequences were synthesized for 
evaluation in an ELISA. Of these, one (TsRSl) was shown to 
be 100% sensitive for the detection of cysticercosis-positive 
serum samples reactive with the 8-kDa bands of LLGP on 
Western blot and 100% specific. Therefore, TsRSl will be one 
component of a cocktail of diagnostic synthetic or recombinant 
proteins. Another two 8-kDa proteins, Tsl8 varl and Tsl8 
var3, were shown to be 97% sensitive and 100% specific. In- 
terestingly, there were significant differences between the lev- 
els of antibody reactivities of cysticercosis-positive serum sam- 
ples with TsRSl compared with their reactivities with Tsl8 
varl or Tsl8 var3. To guard against the possibility of the lack 
of detection of a positive serum sample that has low level of 
reactivity with TsRSl, Tsl8 varl or Tsl8 var3 will also be 
included in the antigen cocktail. 

There are at least two additional immunodiagnostic proteins 
in the LLGP antigen used in the Western blot. These are the 
protein at 50 kDa (GP50) and the non-8-kDa proteins at 24 
and 39 to 42 kDa. While at least 90% of people with cysticer- 
cosis have antibodies that recognize more than one band in the 
immunoblot (40), the sera of some individuals recognize only 
GP50 and only the 24- or 42-kDa protein. Clearly, an assay that 
uses synthetic and recombinant antigens and that has the sen- 
sitivity of the LLGP Western blot assay must also include 
GP50 and the 24- and 42-kDa proteins. A recombinant antigen 
has been developed for GP50, and work is under way to de- 
velop a recombinant antigen for the 24- and 42-kDa proteins. 

The 8-kDa antigens are not the first cloned proteins to be 
tested for their immunoreactivities with cysticercosis-infected 
sera. Hubert et al. (19) identified two proteins, NC-3 and 
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FIG. 4. Antibody reactivities of T. solium 8-kDa proteins. Eight of the T. solium 8-kDa proteins were assayed with a panel of defined serum 
samples: 32 serum samples from patients positive (pos) for cysticercosis, 34 serum samples from patients with other parasitic infections (other), 
and 15 NHSs. Relative absorbance units (RAU) are shown on they axis. The J index for each antigen, calculated at 100% specificity, is shown. 



NC-9, by screening a cDNA expression library with cysticerco- 
sis-positive sera. They expressed these proteins as glutathione 
5-transferase fusion proteins and tested them in an ELISA. 
NC-3 had a sensitivity of 96.3% and a specificity of 91.5%. The 
sensitivity of NC-9 was only 33.3%. Chung et al. (5) cloned 
and expressed the T. solium homolog of the T. crassiceps 10- 
kDa immunodiagnostic antigen (GenBank accession num- 
bers AF076609 and U07150, respectively). These proteins 



are phylogenetically related to the T. solium 8-kDa antigens 
(Fig. 3). When expressed as a glutathione 5-transferase fu- 
sion protein and tested in an immunoblot, the T. solium 
10-kDa protein showed a sensitivity of 88% and a specificity 
of 98%. 

Aside from these proteins, two of which are unrelated to the 
T. solium 8-kDa proteins and one of which is related to the T. 
solium 8-kDa proteins, Sako et al. (34) cloned and expressed 
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FIG 5 Comparisons of the antibody reactivities of the cysticercosis-positive scrum samples. (A) Antibody reactivmes of cysticercosis-positive 
erum samples, in relative absorbance units (RAU), with Ts)8 varl compared to the antibody reactivities of cysticcrcosis- positive scrum samples 
vith TslS varl (B) Antibody reactivities of cysticercosis-positive scrum samples with TsI8 varl compared to the antibody reactivities ol 
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cysticercosis-positive serum samples with TsRSl. The r value for each line is shown. 



three 8-kDa proteins, Agl, AglVl, and Ag2 (GenBank acces- 
sion numbers AB044080, AB044081, and AB044082, respec- 
tively). Agl is a member of the TsRS2 clade, AglVl is in the 
TslS clade, and Ag2 is in the TsRSl clade (Fig. 2). They used 
thioredoxin- and histidine-tagged-fusion proteins and reported 
low levels of antibody reactivity with Agl and good antibody 
reactivities with AglVl and Ag2. For analysis in an ELISA, 
they expressed a histidine-tagged fusion of a chimera of AglVl 
and Ag2 and reported a sensitivity of 89.7% and a specificity of 
100%. These results are similar to the ones reported here. 
TsRS2 varl (GenBank accession number AF356343) was the 
least sensitive of the synthetic proteins tested, and TsRSl 
(GenBank accession number AF082830) was the most sensi- 
tive (Fig. 4). However, we report a sensitivity of 100% with 
synthetic TsRSl. 

While other cloned proteins function as immunodiagnostic 
proteins, such as NC-9 and the T. solium 10-kDa protein, there 
is an advantage to targeting the diagnostic proteins of an an- 
tigen mixture that has been used successfully around the world 
for more than 10 years. These proteins have a track record of 
success when used for immunodiagnosis, with a sensitivity of 
98% and a specificity of 100% (6, 12, 13, 40, 42, 46). Because 
it is known that the native proteins are glycoproteins, the one 
concern is the potential contribution of the carbohydrate moi- 
ety to the antibody response observed in cysticercosis patients. 
This concern was addressed by Obregon-Henao et al. (24), and 
their data showed diminished antibody activities for the native 
8-kDa proteins following deglycosylation. However, our data 
for the synthetic 8-kDa proteins and the data of Sako et al. (34) 
for the recombinant 8-kDa proteins clearly show that the ma- 
jority of the antibody reactivity is retained in proteins lacking 
glycosylation. 

To be able to implement cysticercosis control programs, it is 
essential to have an accurate and rapid assay for diagnosis. 
While the LLGP Western blot' assay is the "gold standard" 
assay for sensitivity and specificity (26a, 28), an ELISA format 
is much preferred because of its ease of use and lower cost. 
The use of cloned antigens rather than partially purified native 
proteins makes use of the ELISA format feasible and opens 
the possibility for the use of other rapid test formats, such as 



lateral flow assays. In addition, the use of a cloned protein as 
antigen removes the dependence on a supply of naturally in- 
fected pigs as a source of parasite material. The ability to 
chemically synthesize the antigen rather than express it greatly 
simplifies the process for obtaining antigen and keeps the cost 
low. The cost of synthetic TsRSl antigen required to assay a 
single sample is $0.03. It may be possible to reduce this cost 
further by epitope mapping TsRSl to determine if a smaller 
peptide(s) retains the same sensitivity for the detection of 
cysticercosis-positive sera. 

The 8-kDa proteins are a family of secreted proteins that 
elicit an antibody response in infected people. The question 
arises, what is the function of these proteins in the metaces- 
tode? The best clue that we have is that phylogenetically they 
are the proteins most closely related to the cestode hydropho- 
bic ligand-binding proteins, characterized in M. expansa (Gen- 
Bank accession number AF312736), the sheep tapeworm, and 
in H. diminuta (GenBank accession number AF249884), the 
rat tapeworm (2, 20, 33) (Fig. 3). Like the cestode hydrophobic 
ligand-binding proteins, the 8-kDa proteins all have a con- 
served tryptophan that has been shown to be essential to the 
function of the cestode hydrophobic ligand-binding proteins 
(2, 32). This conserved tryptophan is not present in the mem- 
bers of the three other clades of phylogenetically related pro- 
teins (Fig. 3). On the basis of these similarities, we believe that 
the 7*. solium 8-kDa proteins may also be hydrophobic ligand- 
binding proteins, but ones that function as extracellular carrier 
proteins for lipids and other hydrophobic ligands. However, 
the possibility that the 8-kDa proteins may play a role in the 
host immune response, like the role proposed for the related 
protein antigen B from £. granulosus, should also be consid- 
ered (30). 

In the host-parasite interactions that occur during infection 
with T. solium metacestodes, it is clear that the 8-kDa proteins 
elicit an antibody response that is diagnostic for cysticercosis. 
In order to determine which flavor of the 18 8-kDa proteins 
identified to date works best as a diagnostic antigen, 9 of the 
66- and 67-mer mature 8-kDa proteins were chemically syn- 
thesized and then evaluated in an ELISA. One, TsRSl, was 
100% sensitive when it was tested with cysticercosis-positive 
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sera reactive with the 8-kDa proteins on Western blot and was 
100% specific. Two others, Tsl8 varl and Tsl.S var3, were 97% 
sensiiive and 100% specific and often showed different levels of 
antibody reactivity with cysticercosis-posilive sera compared to 
the reactivity of TsRSI. TsRSl and either Tsl8 varl or Tsl8 
var3 will be components of a diagnostic antigen cocktail of 
three to four synthetic and recombinant proteins, all derived 
from the diagnostic proteins used for the LLGP Western blot 
assay. Identification of these diagnostic antigens and the de- 
velopment of the ELISA are major steps forward for the se- 
rodiagnosis of eyslicercosis. 
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